SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: DeBonte, Lorin R. 

Fan, Zhegong 
Loh, Willie H-T. 
Shorrosh, Basil S.. 

(ii) TITLE OF THE INVENTION: FATTY ACID DESATURASES AND MUTANT 

SEQUENCES THEREOF 

(iii) NUMBER OF SEQUENCES: 28 

(iv) . CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P.C., P. A. 

(B) STREET: 60 South Sixth Street, Suite 3300 

(C) CITY: Minneapolis 

(D) STATE: MN 

(E) COUNTRY: USA 

(F) ZIP: 55402 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette. 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 9 5 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: ... 

(A) APPLICATION NUMBER: 08/874,109 

(B) FILING DATE: 12-JUN-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: " 

(A) APPLICATION NUMBER: PCT/US96/20090 

(B) FILING DATE: 13-DEC-1996 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Ronald C. Lundquist 

(B) REGISTRATION NUMBER: 37,875 

(C) REFERENCE /DOCKET NUMBER: 07148/063001 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 612/335-5070 

(B) TELEFAX: 612/288-9696 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI- SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica napus 

\(ix) FEATURE: 

(D) OTHER INFORMATION: Wild , type Fad2 . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG GGT GCA GGT GGA AGA ATG CAA GTG TCT CCT CCC TCC AAG AAG TCT 48 
Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 
! 5 10 15 

GAA ACC GAC ACC ATC AAG CGC GTA CCC TGC GAG ACA CCG CCC TTC ACT 96 
Glu Thr Asp Thr lie Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 
20 25 30 

GTC GGA GAA CTC AAG AAA GCA ATC CCA CCG CAC TGT TTC AAA CGC TCG 144 
Val Gly Glu Leu Lys Lys Ala lie Pro Pro His Cys Phe Lys Arg Ser 
35 40 45 

ATC CCT CGC TCT TTC TCC TAC CTC ATC TGG GAC ATC ATC ATA GCC TCC 192 
Tie Pro Arg Ser Phe Ser Tyr Leu lie Trp Asp lie lie lie Ala Ser 
50 55 60 

TGC TTC TAC TAC NTC GCC ACC ACT TAC TTC CCT CTC CTC CCT CAC CCT 240 
Cys Phe Tyr Tyr Xaa Ala Thr Thr Tyr Phe Pro Leu Leu Pro His Pro 
65 70 75 80 

CTC TCC TAC TTC GCC TGG CCT CTC TAC TGG GCC TGC CAA GGG TGC GTC 288 
Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 
85 90 95 

CTA ACC GGC GTC TGG GTC ATA GCC CAC GAA TGC GGC CAC CAC GCC TTC 336 
Leu Thr Gly Val Trp Val He Ala His Glu Cys Gly His His Ala Phe 
100 105 HO 

AGC GAC TAC CAG TGG CTT GAC GAC ACC GTC GGT CTC ATC TTC CAC TCC 384 
Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu He Phe His Ser 
115 120 125 

TTC CTC CTC GTC CCT TAC TTC TCC TGG AAG TAC . AGT CAT CGC AGC CAC 432 
Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Ser His 
130 135 140 . 

CAT TCC AAC ACT GGC TCC * CTC GAG AGA GAC GAA GTG TTT GTC CCC AAG 480 
His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys : 
145 150 155 160 . 

AAG AAG TCA GAC ATC AAG TGG TAC GGC AAG TAC CTC AAC AAC CCT TTG 52 8 

Lys Lys Ser Asp He Lys Trp Tyr. Gly Lys Tyr Leu Asn Asn Pro Leu 
165 170 175 

GGA CGC ACC GTG ATG TTA ACG GTT CAG TTC ACT CTC GGC TGG CCG TTG 576 
Gly Arg Thr Val Met Leu Thr Val Gin Phe Thr Leu Gly Trp Pro Leu . 
180 185 190 

TAC TTA GCC TTC AAC GTC TCG GGA AGA CCT TAC GAC GGC GGC TTC CGT 624 
Tyr Leu Ala Phe Asn . Val Ser Gly Arg Pro Tyr Asp Gly Gly Phe Arg 
195 200 205 
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TGC CAT TTC CAC CCC AAC GCT CCC ATC TAC AAC GAC CGC GAG CGT CTC 672 
Cys His Phe His Pro Asn Ala Pro He Tyr Asn Asp Arg Glu Arg Leu 
210 • 215 220 

CAg\tA TAC ATC TCC GAC GCT GGC ATC CTC GCC GTC TGC TAC GGT CTC 720 
Gin He Tyr He Ser Asp Ala Gly He Leu Ala Val Cys Tyr Gly Leu 
225 230 235 240 

TTC CGT TAC GCC GCC GGC CAG GGA GTG GCC TCG ATG GTC TGC TTC TAC 768 
Phe Arq Tyr Ala Ala Gly Gin Gly Val Ala Ser Met Val Cys Phe Tyr 
245 250 255 

GGA GTC CCG CTT CTG ATT GTC AAT GGT TTC CTC GTG TTG ATC ACT TAC 816 
Glv Val Pro Leu Leu He Val Asn Gly Phe Leu Val Leu He Thr Tyr 
1 260 265 270 

TTG CAG CAC ACG CAT CCT TCC CTG CCT CAC TAC GAT TCG TCC GAG TGG 864 
Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp , 
275 280 285 

* GAT TGG TTC AGG GGA GCT TTG GCT ACC GTT GAC AGA GAC TAC GGA ATC 912 

u Asp Trp Phe Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He 
<J3 290 295 300 

^ TTG AAC AAG GTC TTC CAC AAT ATT ACC GAC ACG CAC GTG GCC CAT CAT 960 

y Leu Asn Lys Val Phe His Asn He Thr Asp Thr His Val Ala His His 

^ 305 310 315 320 

5 CCG TTC TCC ACG ATG CCG CAT TAT CAC GCG ATG GAA GCT ACC AAG GCG 100 8 

^ Pro Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Lys Ala 
= - 325 330 335 

□ ATA AAG CCG ATA CTG GGA GAG TAT TAT CAG TTC GAT GGG ACG CCG GTG 1056 
U He Lys Pro He Leu Gly Glu Tyr Tyr Gin Phe Asp Gly Thr Pro Val 

' m 340 345 350 

= ^ 

GTT AAG GCG ATG TGG AGG GAG GCG AAG GAG TGT ATC TAT GTG GAA CCG 1104 

□ Val Lys Ala Met Trp Arg Glu Ala Lys Glu Cys He Tyr Val Glu Pro 

355 360 365 

GAC AGG CAA GGT GAG AAG AAA GGT GTG TTC TGG TAC AAC AAT AAG TTA T 1153 
Asp Arg Gin Gly Glu Lys . Lys Gly Val Phe Trp Tyr Asn Asn Lys Leu 
370 375 380 . 



GA 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 
! 5 10 15 



1155 
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Glu Thr Asp Thr lie Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 
20 25 30 

ValXdy Glu Leu Lys Lys Ala He Pro Pro His Cys Phe Lys Arg Ser 
35 40 45 

He Pro Arg Ser Phe Ser Tyr Leu He Trp Asp He He He Ala Ser 
50 55 60 

Cys Phe Tyr Tyr Xaa Ala Thr Thr Tyr Phe Pro Leu Leu Pro His Pro 
65 70 75 80 

Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 
85 90 95 

Leu Thr Gly Val Trp Val He Ala His Glu Cys Gly His His Ala Phe 
100 105 110 

Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu He Phe His Ser 
115 120 125 

Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Ser His 
130 135 140 

His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys 
145 150 155 160 

Lys Lys Ser Asp He Lys Trp Tyr Gly Lys Tyr Leu Asn Asn Pro Leu 
165 170 175 

Gly Arg Thr Val Met Leu Thr Val Gin Phe Thr Leu Gly Trp Pro Leu 
180 185 190 

Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Gly Phe Arg 
195 200 205 

Cys His Phe His Pro Asn Ala Pro He Tyr Asn Asp Arg Glu Arg Leu 
210 215 220 

Gin He Tyr He Ser Asp Ala Gly He Leu Ala Val Cys Tyr Gly Leu 
225 230 235 240 

Phe Arg Tyr Ala Ala Gly Gin Gly Val Ala Ser Met .Val Cys Phe Tyr 
245 250 255 

Gly Val Pro Leu Leu He Val Asn Gly Phe Leu Val Leu He Thr Tyr. 
260 265 270 

Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp 
275 280 285 

Asp Trp Phe Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He 
290 295 300 

Leu Asn Lys Val Phe His Asn lie Thr Asp Thr His Val Ala His His 
305 310 315 320 

Pro Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Lys Ala 
325 330 335 

He Lys Pro He Leu Gly Glu Tyr Tyr Gin Phe Asp Gly Thr Pro Val 
.340 345 350 
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Val Lys Ala Met Trp Arg Glu Ala Lys Glu Cys lie Tyr Val Glu Pro 
355 360 365 

Asp\Arg Gin Gly Glu Lys Lys Gly Val Phe Trp Tyr Asn Asn Lys Leu 
370 375 380 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica napus 

(ix) FEATURE: 

(D) OTHER INFORMATION: G to A transversion 
mutation at nucleotide 316. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG GGT GCA GGT GGA AGA ATG CAA GTG TCT CCT CCC TCC AAG AAG TCT 48 
Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 
1 5 10 15 

GAA ACC GAC ACC ATC AAG. CGC GTA CCC TGC GAG ACA CCG CCC TTC ACT 96 
Glu Thr Asp Thr He Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 
20 25 30 

GTC GGA GAA CTC AAG AAA GCA ATC CCA CCG CAC TGT TTC AAA CGC TCG 144 
Val Gly Glu Leu Lys Lys Ala He Pro Pro His Cys Phe Lys Arg Ser 
35 40 45 

ATC CCT CGC TCT TTC TCC TAC CTC ATC TGG GAC ATC ATC ATA GCC TCC 192 
He Pro Arg Ser Phe Ser Tyr Leu He Trp Asp He *Ile He Ala Ser 
50 55 60 

TGC TTC TAC TAC NTC GCC ACC ACT TAC TTC CCT CTC CTC CCT CAC CCT 240 
Cys Phe Tyr Tyr Xaa Ala Thr Thr Tyr Phe Pro Leu Leu Pro His Pro .. 
65 70 75 80 

CTC TCC TAC TTC GCC TGG CCT CTC TAC TGG GCC TGC CAA GGG TGC GTC 288 
Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 
85 90 95 

CTA ACC GGC GTC TGG GTC ATA GCC CAC AAG TGC GGC CAC CAC GCC TTC 336 
Leu Thr Gly Val Trp Val He Ala His Lys Cys Gly His His Ala Phe 
100 105 110 

AGC GAC TAC CAG TGG CTT GAC GAC ACC GTC . GGT CTC ATC TTC CAC TCC 384 
Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu He Phe His Ser 
115 120 125 
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TTC CTC CTC GTC CCT TAC TTC TCC TGG AAG TAC AGT CAT CGC AGC CAC 432 
Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Ser His 
130 • 135 140 

CAT TCC AAC ACT GGC TCC CTC GAG AGA GAC GAA GTG TTT GTC CCC AAG 480 
His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys 
145 150 155 160 

AAG AAG TCA GAC ATC AAG TGG TAC GGC AAG TAC CTC AAC AAC CCT TTG 528 
Lys Lys Ser Asp lie Lys Trp Tyr Gly Lys Tyr Leu Asn Asn Pro Leu 
165 ..170 175 

GGA CGC ACC GTG ATG TTA ACG GTT CAG TTC ACT CTC GGC TGG CCG TTG 576 
Gly Arg Thr Val Met Leu Thr Val Gin Phe Thr Leu Gly Trp Pro Leu 
180 185 190 

TAC TTA GCC TTC AAC GTC TCG GGA AGA CCT TAC GAC GGC GGC TTC CGT 624 
Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Gly Phe Arg. 
195 200 205 

O TGC CAT TTC CAC CCC AAC GCT CCC ATC TAC AAC GAC CGC GAG CGT CTC 672 
n Cys His Phe His Pro Asn Ala Pro He Tyr Asn Asp Arg Glu Arg Leu 
n 210 215 220 

CAG ATA TAC ATC TCC GAC GCT GGC ATC CTC GCC GTC TGC TAC GGT CTC 720 
M= Gin He Tyr He Ser Asp Ala Gly He Leu Ala Val Cys Tyr Gly Leu 
225 230 235 240 

O TTC CQT TAC gcc GCC GGC CAG GGA GTG GCC TCG ATG GTC TGC TTC TAC 768 
5 Phe Arg Tyr Ala Ala Gly Gin Gly Val Ala Ser Met Val Cys Phe Tyr 
a 245 250 255 

GGA GTC CCG CTT CTG ATT GTC AAT GGT TTC CTC GTG TTG ATC ACT TAC 816 
:f Gly Val Pro Leu Leu He Val Asn Gly Phe Leu Val Leu He Thr Tyr 
Of 260 265 270 

« TTG CAG CAC ACG CAT CCT TCC CTG CCT CAC TAC GAT TCG TCC GAG TGG 864 

Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp 
H= 275 280 285 

GAT TGG TTC AGG GGA GCT TTG GCT ACC GTT GAC . AGA GAC TAC GGA ATC 912 
Asp Trp Phe Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He. 
290 295 300 * 

TTG AAC AAG GTC TTC CAC AAT ATT ACC GAC ACG CAC GTG GCC CAT CAT 960 
Leu Asn Lys Val Phe His Asn He Thr Asp Thr His Val Ala His His 
305 310 315 320 

CCG TTC TCC ACG ATG CCG CAT TAT CAC GCG ATG GAA GCT ACC AAG GCG 1008 
Pro Phe Ser Thr Met Pro His Tyr. His Ala Met Glu Ala Thr Lys Ala 
325 330 335 

ATA AAG CCG ATA CTG GGA GAG TAT TAT CAG TTC GAT GGG ACG CCG GTG 1056 
He Lys Pro He Leu Gly Glu Tyr Tyr Gin Phe Asp Gly Thr Pro Val 
340 345 350 

GTT AAG GCG ATG TGG AGG GAG GCG AAG GAG TGT ATC TAT GTG GAA CCG 1104 
Val Lys Ala Met Trp Arg Glu Ala Lys Glu Cys He Tyr Val Glu Pro 
355 360 365 
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GAC AGG CAA GGT GAG AAG AAA GGT GTG TTC TGG TAG AAC AAT AAG TTA T 1153 
Asp Arg Gin Gly Glu Lys Lys Gly Val Phe Trp Tyr Asn Asn Lys Leu 
370 • 375 380 

\ 

GA 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 amino. acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 
1 5 10 15 

Glu Thr Asp Thr He Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 
20 25 30 

Val Gly Glu Leu Lys Lys Ala He Pro Pro His Cys Phe Lys Arg Ser 
35 40 45 

He Pro Arg Ser Phe Ser Tyr Leu He Trp Asp. He He He Ala Ser 
50 55 60 

Cys Phe Tyr Tyr Xaa Ala Thr Thr Tyr Phe. Pro Leu Leu Pro His Pro 
65 . 70 75 80 

Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 
85 90 95 

Leu Thr Gly Val Trp Val He Ala His Lys Cys Gly His His Ala Phe 
100 105 110 

Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu He Phe His Ser 
115 120 125 

Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Ser His 
130 135 140 ♦ 

His Ser Asn Thr Gly Ser . Leu Glu Arg Asp Glu Val Phe Val Pro Lys 
145 150 155 160 

Lys Lys Ser Asp lie Lys Trp Tyr Gly Lys Tyr . Leu Asn Asn Pro Leu 
165 170 175 

Gly Arg Thr Val Met Leu Thr Val Gin Phe Thr Leu Gly Trp Pro Leu 
180 185 190 

Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Gly Phe Arg 
195 200 205 

Cys His Phe His Pro Asn Ala Pro He Tyr Asn Asp Arg Glu Arg Leu 
210 215 220 

Gin He Tyr He Ser Asp Ala Gly He Leu Ala Val- Cys Tyr Gly Leu 
225 230 235 240 
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Phe Arg Tyr Ala Ala Gly Gin Gly Val Ala Ser Met Val Cys Phe Tyr 
245 250 255 

GlyXval Pro Leu Leu He Val Asn Gly Phe Leu Val Leu He Thr Tyr 
260 265 270. 

Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp 
275 280 285 

Asp Trp Phe Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He 
290 295 300 

Leu Asn Lys Val Phe His Asn lie Thr Asp Thr His Val Ala His His 
305 310 315 320 

Pro Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Lys Ala 
325 330 335 

lie Lys Pro He Leu Gly Glu Tyr Tyr Gin Phe Asp Gly Thr Pro Val 
340 345 350 

Val Lys Ala Met Trp Arg Glu Ala Lys Glu Cys lie. Tyr Val Glu Pro 
355 360 365 

Asp Arg Gin Gly Glu Lys Lys Gly Val Phe Trp Tyr Asn Asn Lys Leu 
370 375 380 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica napus 

(ix) FEATURE: 

(D) OTHER INFORMATION: Wild type Fad2 . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG GGT GCA GGT GGA AGA ATG CAA GTG TCT CCT CCC TCC AAA AAG TCT 
Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 
1 5 10 15 

GAA ACC GAC AAC ATC AAG CGC GTA CCC TGC GAG ACA CCG CCC TTC ACT 
Glu Thr Asp Asn He Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 
20 25 30 

GTC GGA GAA CTC AAG - AAA GCA ATC CCA . CCG CAC TGT TTC AAA CGC TCG 
Val Gly Glu Leu Lys Lys Ala He Pro Pro His Cys Phe Lys Arg Ser 
35 40 45 
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ATC CCT CGC TCT TTC TCC TAC CTC ATC TGG GAC ATC ATC ATA GCC TCC 
lie Pro Arg Ser Phe Ser Tyr Leu lie Trp Asp lie He He Ala Ser 

55 



50 



60 



TGC TTC TAC TAC GTC GCC ACC ACT TAC TTC CCT CTC CTC CCT CAC CCT 
Cys Phe Tyr Tyr Val Ala Thr Thr Tyr Phe Pro Leu Leu Pro His Pro 

70 



65 



75 



80 



CTC TCC TAC TTC GCC TGG CCT CTC TAC TGG GCC TGC CAG GGC TGC GTC 
Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 
85 90 95 

CTA ACC GGC GTC TGG GTC ATA GCC CAC GAG TGC GGC CAC CAC GCC TTC 
Leu Thr Gly Val Trp Val He Ala His Glu Cys Gly His His Ala Phe 
100 105 110 

AGC GAC TAC CAG TGG CTG GAC GAC ACC GTC GGC CTC ATC TTC CAC TCC 
Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu He Phe His Ser 
115 120 125 

TTC CTC CTC GTC CCT TAC TTC TCC TGG AAG TAC AGT CAT CGA CGC CAC 
Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 
130 135 140 

CAT TCC AAC ACT GGC TCC CTC GAG AGA GAC GAA GTG TTT GTC CCC AAG 
His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys 
145 150 155 160 

AAG AAG TCA GAC ATC AAG TGG TAC GGC AAG TAC CTC AAC AAC CCT TTG 
Lys Lys Ser Asp He Lys Trp Tyr Gly Lys Tyr Leu Asn Asn Pro Leu 
165 170 175 

GGA CGC ACC GTG ATG TTA ACG GTT CAG TTC ACT CTC GGC TGG CCT TTG 
Gly Arg Thr Val Met Leu Thr Val Gin Phe Thr Leu Gly Trp Pro Leu 
180 185 190 

TAC TTA GCC TTC AAC GTC TCG GGG AGA CCT TAC GAC GGC GGC TTC GCT 
Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Gly Phe Ala 
195 200 205 

TGC CAT TTC CAC CCC AAC GCT CCC ATC TAC AAC GAC CGC GAG CGT CTC 
Cys His Phe His Pro Asn Ala Pro He Tyr Asn Asp Arg Glu Arg Leu 
210 215 220 * 

CAG ATA TAC ATC TCC GAC GCT GGC ATC CTC GCC GTC TGC TAC GGT CTC 
Gin He Tyr He Ser Asp Ala Gly He Leu Ala Val Cys Tyr Gly Leu 
225 230 235 240 

TAC CGC TAC GCT GCT GTC CAA GGA GTT GCC TCG ATG GTC TGC TTC TAC 
Tyr Arg Tyr Ala Ala Val Gin Gly Val Ala Ser Met Val Cys Phe Tyr 
245 250 255 

GGA GTT CCG CTT CTG ATT GTC AAT GGG TTC TTA GTT TTG ATC ACT TAC 
Gly Val Pro Leu Leu He Val Asn Gly Phe Leu Val Leu He Thr Tyr 
260 265 270 

TTG CAG CAC ACG CAT CCT TCC CTG CCT CAC TAT GAC TCG TCT GAG TGG 
Leu Gin His Thr His Pro Ser Leu Pro His -Tyr Asp Ser Ser Glu Trp 
275 280 285 
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288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 
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GAT TGG TTG AGG GGA GCT TTG GCC ACC GTT GAC AGA GAC TAC GGA ATC 912 

Asp Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He 
290 • 295 300 

v 

TTG AAC AAG GTC TTC CAC AAT ATC ACG GAC ACG CAC GTG GCG CAT CAC 960 

Leu Asn Lys Val Phe His Asn He Thr Asp Thr His Val Ala His His 
305 310 315 320 

CTG TTC TCG ACC ATG CCG CAT TAT CAT GCG ATG GAA GCT ACG AAG GCG 1008 
Leu Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Lys Ala 
325 330 335 . 

ATA AAG CCG ATA CTG GGA GAG TAT TAT CAG TTG CAT GGG ACG CCG GTG 1056 
He Lys Pro He Leu Gly Glu Tyr Tyr Gin Leu His Gly Thr Pro Val 
340 345 350 

GTT AAG GCG ATG TGG AGG GAG GCG AAG GAG TGT ATC TAT GTG GAA CCG 1104 
Val Lys Ala Met Trp Arg Glu Ala Lys Glu Cys He Tyr Val Glu Pro 
355 360 365 

GAC AGG CAA GGT GAG AAG AAA GGT GTG TTC TGG TAC AAC AAT AAG TTA T 1153 
O Asp Arg Gin Gly Glu Lys Lys Gly Val Phe Trp Tyr Asn Asn Lys Leu 
-J3 370 375 380 

Jj GA H55 

M- (2) INFORMATION FOR SEQ ID NO: 6: 

™ (i) SEQUENCE CHARACTERISTICS: 
^=f (A) LENGTH: 3 84 amino acids 

4= (B) TYPE: amino acid 

= (D) TOPOLOGY: linear 

£7 (ii) MOLECULE TYPE: protein 

W (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

yo 

g Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 
rf 1 5 10 15 

Glu Thr Asp Asn He Lys Arg Val Pro Cys Glu. Thr Pro Pro Phe Thr 
20 25 30 

Val Gly Glu Leu Lys Lys Ala lie Pro Pro His Cys Phe Lys Arg Ser 
35 40 45 

He Pro Arg Ser Phe Ser Tyr Leu He Trp Asp lie He He Ala Ser 
50 55 60 

Cys Phe Tyr Tyr Val Ala Thr Thr Tyr Phe Pro Leu Leu Pro His Pro 
65 70 75 80 

Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 
85 90 95 

Leu Thr Gly Val Trp Val He Ala His Glu Cys Gly His His Ala Phe 
100 105 HO 

Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu He Phe His Ser . ' 
115 120 125 
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Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 
130 135 140 

His\ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys 
145 150 155 160 

Lvs Lys Ser Asp lie Lys Trp Tyr Gly Lys Tyr Leu Asn Asn Pro Leu 
165 170 . 175 

Gly Arg Thr Val Met Leu Thr Val Gin Phe Thr Leu Gly Trp Pro Leu 
180 185 190 

Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Gly Phe Ala 
195 200 205 

Cys His Phe His Pro Asn Ala Pro He Tyr Asn Asp Arg Glu Arg Leu 
210 215 220 

Gin He Tyr He Ser Asp Ala Gly He Leu Ala Val Cys Tyr Gly Leu 
225 230 235 240 

Tyr Arg Tyr Ala Ala Val Gin Gly Val Ala Ser Met Val Cys Phe Tyr 
245 250 255 

Gly Val Pro Leu Leu He Val Asn Gly Phe Leu Val Leu He Thr Tyr 
260 265 270 

Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp 
275 280 285 

Asp Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He 
290 295 300 

Leu Asn Lys Val Phe His. Asn He Thr Asp Thr His Val Ala His His 
305 310 315 320 

Leu Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Lys Ala 
325 330 335 

He Lys Pro He Leu Gly Glu Tyr Tyr Gin Leu His Gly Thr Pro Val 
340 345 350 

Val Lys Ala Met Trp Arg Glu Ala Lys Glu Cys He .Tyr Val Glu Pro 
355 360 365 

Asp Arg Gin Gly Glu Lys Lys Gly Val Phe Trp Tyr Asn Asn Lys Leu 
370 375 380 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI- SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brassica napus 

X(ix) FEATURE: 

(D) OTHER INFORMATION: T to A transversion 
mutation at nucleotide 515. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG GGT GCA GGT GGA AGA ATG CAA GTG TCT CCT CCC TCC AAA AAG TCT 
Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 
1 5 10 15 

GAA ACC GAC AAC ATC AAG CGC GTA CCC TGC GAG ACA CCG CCC TTC ACT 
Glu Thr Asp Asn He Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 
20 25 30 

GTC GGA GAA CTC AAG AAA GCA ATC CCA CCG CAC TGT TTC AAA CGC TCG 
Val Gly Glu Leu Lys Lys Ala He Pro Pro His Cys Phe Lys Arg Ser 
35 40 . 45 

ATC CCT CGC TCT TTC TCC TAC CTC ATC TGG GAC ATC ATC ATA GCC TCC 
He Pro Arg Ser Phe Ser Tyr Leu He Trp Asp He He He Ala Ser 
50 55 ^ 60 

TGC TTC TAC TAC GTC GCC ACC ACT TAC TTC CCT CTC CTC CCT CAC CCT 
Cys Phe Tyr Tyr Val Ala Thr Thr Tyr Phe Pro Leu Leu Pro His Pro 
65 70 75 80 

CTC TCC TAC TTC GCC TGG CCT CTC TAC TGG GCC TGC CAG GGC TGC GTC 
Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 
85 90 95 

CTA ACC GGC GTC TGG GTC ATA GCC CAC GAG TGC GGC CAC CAC GCC TTC 
Leu Thr Gly Val Trp Val He Ala His Glu Cys Gly His His Ala Phe 
100 105 HO 

AGC GAC TAC CAG TGG CTG GAC GAC ACC GTC GGC CTC ATC TTC CAC TCC 
Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu He Phe His Ser 
115 120 125 

TTC CTC CTC GTC CCT TAC TTC TCC TGG AAG TAC AGT CAT CGA CGC CAC 
Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser .His Arg Arg His 
130 135 140 

CAT TCC AAC ACT GGC TCC CTC GAG AGA GAC GAA GTG TTT GTC CCC AAG: 
His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys . 
145 150 . 155 160 

AAG AAG TCA GAC ATC AAG TGG TAC GGC AAG TAC CAC AAC AAC CCT TTG 
Lys Lys Ser Asp He Lys Trp Tyr Gly Lys Tyr His Asn Asn Pro Leu 
165 170 175 

GGA CGC ACC GTG ATG TTA ACG GTT CAG TTC ACT CTC GGC TGG CCT TTG. 
Gly Arq Thr Val Met Leu Thr Val Gin Phe Thr Leu Gly Trp Pro Leu 
180 185 190 

TAC TTA GCC TTC AAC GTC TCG GGG AGA CCT TAC GAC GGC GGC TTC GCT 
Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Gly Phe Ala 



195 



200 



205 
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TGC CAT TTC CAC CCC AAC GCT CCC ATC TAC AAC GAC CGC GAG CGT CTC 672 

Cys His Phe His Pro Asn Ala Pro lie Tyr Asn Asp Arg Glu Arg Leu 
210 • 215 220 

\ 

CAG ATA TAC ATC TCC GAC GCT GGC ATC CTC GCC GTC TGC TAC GGT CTC 720 

Gin He Tyr He Ser Asp Ala Gly He Leu Ala Val Cys Tyr Gly Leu 
225 230 235 240 

TAC CGC TAC GCT GCT GTC CAA GGA GTT GCC TCG ATG GTC TGC TTC TAC 768 
Tyr Arg Tyr Ala Ala Val Gin Gly Val Ala Ser Met Val Cys Phe Tyr 
245 250 255 

GGA GTT CCG CTT CTG ATT GTC AAT GGG TTC TTA GTT TTG ATC ACT TAC 816 
Gly Val Pro Leu Leu He Val Asn Gly Phe Leu Val Leu He Thr Tyr 
260 265 270 

TTG CAG CAC ACG CAT CCT TCC CTG CCT CAC TAT GAC TCG TCT GAG TGG 864 
Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp 
275 280 285 

GAT TGG TTG AGG GGA GCT TTG GCC ACC GTT GAC AGA GAC TAC GGA ATC 912 
Asp Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He 
290 295 300 

TTG AAC AAG GTC TTC CAC AAT ATC ACG GAC ACG CAC GTG GCG CAT CAC 960 
Leu Asn Lys Val Phe His Asn lie Thr Asp Thr His Val Ala His His 
305 310 315 320 

CTG TTC TCG ACC ATG CCG CAT TAT CAT GCG ATG GAA GCT ACG AAG GCG 1008 
Leu Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Lys Ala 
325 330 335 

ATA AAG CCG ATA CTG GGA GAG TAT TAT CAG TTG CAT GGG ACG CCG GTG 1056 
He Lys Pro He Leu Gly Glu Tyr. Tyr Gin Leu His Gly Thr Pro Val 
340 345 350 

GTT AAG GCG ATG TGG AGG GAG GCG AAG GAG TGT ATC TAT GTG GAA CCG . 1104 
Val Lys Ala Met Trp Arg Glu Ala Lys Glu Cys lie Tyr Val Glu Pro 
355 360 * 365 

GAC AGG CAA GGT GAG AAG AAA GGT GTG TTC TGG TAC AAC AAT AAG TTA T 1153 
Asp Arg Gin Gly Glu Lys Lys Gly Val Phe Trp Tyr Asn Asn Lys Leu 
370 375 380 . 



GA 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 
1 5 10 15 

Glu Thr Asp Asn He Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 
20 25 30 



1155 
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Val Gly Glu Leu Lys Lys Ala He Pro Pro His Cys Phe Lys Arg Ser 
35 40 45 

IleHpro Arg Ser Phe Ser Tyr Leu He Trp Asp He He He Ala Ser 
50 55 60 

Cys Phe Tyr Tyr Val Ala Thr Thr Tyr Phe Pro Leu Leu Pro His Pro 
65 70 75 80 

Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 
85 90 95 

Leu Thr Gly Val Trp Val He Ala His Glu Cys Gly His His Ala Phe 
100 105 110 

Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu He Phe His Ser 
115 120 125 

Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 
130 135 140 

His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys 
145 150 155 160 

Lys Lys Ser Asp He Lys Trp Tyr Gly Lys. Tyr His Asn Asn Pro Leu 
165 170 175 

Gly Arg Thr Val Met Leu Thr Val Gin Phe Thr Leu Gly Trp Pro Leu 
180 185 190 

Tyr Leu Ala Phe Asn. Val Ser Gly Arg Pro Tyr Asp Gly Gly Phe Ala 
195 .200 205 . 

Cys His Phe His Pro Asn. Ala Pro He Tyr Asn Asp Arg Glu Arg. Leu 
210 215 220 

Gin He Tyr He Ser Asp Ala Gly He Leu Ala Val Cys Tyr Gly Leu 
225 230 235 240 

Tyr Arg Tyr Ala Ala Val Gin Gly Val Ala Ser Met Val Cys Phe Tyr 
245 250 255 

Gly Val Pro Leu Leu He Val Asn Gly Phe Leu Val .Leu He Thr Tyr 
260 265 270 

Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp 
275 280 285 

Asp Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He 
290 295 300 

Leu Asn Lys Val Phe His Asn He Thr Asp Thr His Val Ala His His 
.305 310 . 315 320 

Leu Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Lys Ala 
325 330 335 

He Lys Pro He Levi Gly Glu Tyr Tyr Gin. Leu His Gly Thr Pro Val 
340 345 350 

Val Lys Ala Met Trp Arg Glu Ala Lys Glu Cys He Tyr Val Glu Pro 
355 360 365 
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Asp Arg Gin Gly Glu Lys Lys Gly Val Phe Trp Tyr Asn Asn Lys Leu 
370 375 380 

\ (2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 1155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
( ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .1152 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATG GGT GCA GGT GGA AGA ATG CAA GTG TCT CCT CCC TCC AAA AAG TCT 48 
Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 
1 5 10 15 

GAA ACC GAC AAC ATC AAG CGC GTA CCC TGC GAG ACA CCG CCC TTC ACT 96 
Glu Thr Asp Asn lie Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 
20 25 30 

GTC GGA GAA CTC AAG AAA GCA ATC CCA CCG CAC TGT TTC AAA CGC TCG 144 
Val Gly Glu Leu Lys Lys Ala lie Pro Pro His Cys Phe Lys Arg Ser 
35 40 45 

ATC CCT CGC TCT TTC TCC TAC CTC ATC TGG GAC ATC ATC ATA GCC TCC 192 
He Pro Arg Ser Phe Ser Tyr Leu He Trp Asp He He lie Ala Ser 
50 55 60 

TGC TTC TAC TAC GTC GCC ACC ACT TAC TTC CCT CTC CTC CCT CAC CCT 240 
Cys Phe Tyr Tyr Val Ala Thr Thr Tyr Phe Pro Leu Leu Pro His Pro 
65 70 75 80 

CTC TCC TAC TTC GCC TGG CCT CTC TAC TGG GCC TGC CAG GGC TGC GTC 288 
Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cya Gin Gly Cys Val 
85 90 95 

CTA ACC GGC GTC TGG GTC ATA GCC CAC GAG TGC GGC CAC CAC GCC TTC 336 
Leu Thr Gly Val Trp Val He Ala His Glu Cys Gly His His Ala Phe 
100 105 110 

AGC GAC TAC CAG TGG CTG GAC GAC ACC GTC GGC CTC ATC TTC CAC TCC 384 
Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu He Phe His Ser 
115 120 125 

TTC CTC CTC GTC CCT TAC TTC TCC TGG AAG TAC AGT CAT CGA CGC CAC 432 
Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 
130 135 140 

CAT TCC AAC ACT GGC TCC CTC GAG AGA GAC GAA GTG TTT GTC CCC AAG 480 
His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys 
145 150 155 160 
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AAG AAG TCA GAC ATC AAG TGG TAC GGC AAG TAC CTC AAC AAC CCT TTG 528 
Lys Lys Ser Asp lie Lys Trp Tyr Gly Lys Tyr Leu Asn Asn Pro Leu 
165 170 175 

GGA CGC ACC GTG ATG TTA ACG GTT CAG TTC ACT CTC GGC TGG CCT TTG 576 
Gly Arg Thr Val Met Leu Thr Val Gin Phe Thr Leu Gly Trp Pro Leu 
180 185 190 

TAC TTA GCC TTC AAC GTC TCG GGG AGA CCT TAC GAC GGC GGC TTC GCT 624 
Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Gly Phe Ala 
195 200 205 

TGC CAT TTC CAC CCC AAC GCT CCC ATC TAC AAC GAC CGT GAG CGT CTC 672 
Cys His Phe His Pro Asn Ala Pro lie Tyr Asn Asp Arg Glu Arg Leu 
210 215 220 

CAG ATA TAC ATC TCC GAC GCT GGC ATC CTC GCC GTC TGC TAC GGT CTC 720 
Gin lie Tyr lie Ser Asp Ala Gly lie Leu Ala Val Cys Tyr Gly Leu 
225 230 235 240 

TAC CGC TAC GCT GCT GTC CAA GGA GTT GCC TCG ATG GTC TGC TTC TAC 768 
Tyr Arg Tyr Ala Ala Val Gin Gly Val Ala Ser Met Val Cys Phe Tyr 
245 250 255 

GGA GTT CCT CTT CTG ATT GTC AAC GGG TTC TTA GTT TTG ATC ACT TAC 816 
Gly Val Pro Leu Leu lie Val Asn Gly Phe Leu Val Leu lie Thr Tyr 
260 265 270 

TTG CAG CAC ACG CAT CCT TCC CTG CCT CAC TAT GAC TCG TCT GAG TGG 864 
Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp 
275 280 285 

GAT TGG TTG AGG GGA GCT TTG GCC ACC GTT GAC AGA GAC TAC GGA ATC 912 
Asp Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly lie 
290 295 300 

TTG AAC AAG GTC TTC CAC AAT ATC ACG GAC ACG CAC GTG GCG CAT CAC 960 
Leu Asn Lys Val Phe His Asn lie Thr Asp Thr His Val Ala His His 
305 310 315 320 

CTG TTC TCG ACC ATG CCG CAT TAT CAT GCG ATG GAA GCT ACG AAG GCG 1008 
Leu Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Lys Ala 
325 330 • 335 

ATA AAG CCG ATA CTG GGA GAG TAT TAT CAG TTC GAT GGG ACG CCG GTG 1056 
lie Lys Pro lie Leu Gly Glu Tyr Tyr Gin Phe Asp Gly Thr Pro Val 
340 345 350 

GTT AAG GCG ATG TGG AGG GAG GCG AAG GAG TGT ATC TAT GTG GAA CCG 1104 
Val Lys Ala Met Trp Arg Glu Ala Lys Glu Cys lie Tyr Val Glu Pro 
355 360 365 



GAC AGG CAA GGT GAG AAG AAA GGT GTG TTC TGG TAC AAC AAT AAG TTA T 1153 
Asp Arg Gin Gly Glu Lys Lys Gly Val Phe Trp Tyr Asn Asn Lys Leu 
.370 375 380 

GA 1155 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 





Met 


Gly 


Ala 


Gly 


Gly 


Arg 


Met 


Gin Val 


Ser Pro 


Pro Ser Lys Lys Ser 
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Glu 


Thr 


Asp 
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He 
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Arg 


Val Pro 


Cys Glu 


Thr Pro Pro Phe Thr 
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Val 
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He Pro 


Pro His 


Cys Phe Lys Arg Ser 
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Trp Asp 
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Thr Tyr 


Phe Pro 


Leu Leu Pro His Pro 
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Leu 
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Trp 
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Leu Tyr 


Trp Ala 


Cys Gin Gly Cys Val 
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Val 


Trp 


Val 


He 


Ala His 


Glu Cys 


Gly His His Ala Phe 
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Tyr 
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Trp 
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Asp Thr 


Val Gly 


Leu lie Phe His Ser 


a.r : 
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Phe 


Leu 


Leu 


Val 


Pro 


Tyr 


Phe 


Ser Trp 


Lys Tyr 


Ser His Arg Arg His 
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Ser 
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Leu 


Glu Arg 


Asp Glu 


Val Phe Val Pro Lys 
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D 


Lys 


Lys 


Ser 
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He 


Lys 


Trp 


Tyr Gly 


Lys Tyr 


Leu Asn Asn Pro Leu 
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Thr 


Val 


Met 
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Thr 


Val Gin 


Phe Thr 


Leu Gly Trp Pro Leu 
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Cys 


His 


Phe 


His 


Pro 


Asn 


Ala 


Pro He 


Tyr Asn 


Asp Arg Glu Arg Leu 
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215 
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He 


Tyr 


He 


Ser 


Asp 


Ala 


Gly He 


Leu Ala 


Val Cys Tyr Gly Leu 
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Tyr 


Arg 


Tyr 


Ala 


Ala 


Val 


Gin 


Gly Val 


Ala Ser 


Met Val Cys Phe Tyr 
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Val 


Pro 


Leu 


Leu 


He 


Val 


Asn Gly 


Phe Leu 


Val Leu He Thr Tyr 
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Leu 


Gin 
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Thr 


His 


Pro 


Ser 


Leu Pro 
280 


His Tyr 


Asp Ser Ser Glu Trp 
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Trp 
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Arg 
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Ala 


Leu 


Ala Thr 


Val Asp 


Arg Asp Tyr Gly lie 
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Asn 


Lys 


Val 


Phe 


His 


Asn 


lie Thr 


Asp Thr 


His Val Ala His His 




305 
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Phe 
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Thr 


Met 
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Pro 


His 


Tyr His 


Ala Met 
330 


Glu Ala Thr Lys Ala 
335 




He 


Lys 


Pro 


He 


Leu 


Gly 


Glu 


Tyr Tyr 


Gin Phe 


Asp Gly Thr Pro Val 








340 
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Val 


Lys 


Ala 


Met 


Trp 


Arg 


Glu 


Ala Lys 


Glu Cys 


He Tyr Val Glu Pro 
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360 




365 




Asp 


Arg 
370 


Gin 


Gly 


Glu 


Lys 


Lys 
375 


Gly Val 


Phe Trp 


Tyr Asn Asn Lys Leu 
380 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
V (A) LENGTH: 1155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1152 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



ATG GGT GCA GGT GGA AGA ATG CAA GTG TCT CCT CCC TCC AAA AAG TCT 48 
Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 
1 5 10 15 

GAA ACC GAC AAC ATC AAG CGC GTA CCC TGC GAG ACA CCG CCC TTC ACT .96 
Glu Thr Asp Asn He Lys Arg Val Pro Cys Glu Thr Pro. Pro Phe Thr 
20 25 30 

GTC GGA GAA CTC AAG AAA GCA ATC CCA CCG CAC TGT TTC AAA CGC TCG 144 
Val Gly Glu Leu Lys Lys Ala He Pro Pro His Cys Phe Lys Arg Ser 
35 40 45 

ATC CCT CGC TCT. TTC TCC TAC CTC ATC TGG GAC ATC ATC ATA GCC TCC 192 
He Pro Arg Ser. Phe Ser Tyr Leu He Trp Asp lie lie He Ala. Ser 
50 55 60 

TGC TTC TAC TAC GTC GCC ACC ACT TAC TTC CCT CTC CTC CCT CAC CCT 240 
Cys Phe Tyr Tyr Val Ala Thr Thr Tyr Phe Pro Leu Leu Pro His Pro 
65 70 75 80 

CTC TCC TAC TTC GCC TGG CCT CTC TAC TGG GCC TGC CAG GGC TGC GTC 288 
Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 
85 90 95 

CTA ACC GGC GTC TGG GTC ATA GCC CAC AAG TGC GGG CAC CAC GCC TTC 336 
Leu Thr Gly Val Trp Val He Ala His Lys Cys Gly His His Ala Phe 
100 105 HO 

AGC GAC . TAC CAG TGG CTG GAC GAC ACC GTC GGC CTC ATC TTC CAC TCC 384 
Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu He Phe His Ser 
115 120 125 

TTC CTC CTC GTC CCT TAC TTC TCC TGG AAG TAC AGT CAT CGA CGC CAC 432 
Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His. Arg Arg His 
130 135 140 

CAT TCC AAC ACT GGC TCC CTC GAG AGA GAC GAA GTG TTT GTC CCC AAG 480 
His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys 
145 150 155 160 

AAG AAG TCA GAC ATC AAG TGG TAC GGC AAG TAC CTC AAC AAC CCT TTG 528 
Lys Lys Ser Asp He Lys Trp Tyr Gly Lys Tyr Leu Asn Asn Pro Leu 
. 165 170 175 
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GGA CGC ACC GTG ATG TTA ACG GTT CAG TTC ACT CTC GGC TGG CCT TTG 576 
Gly Arg Thr Val Met Leu Thr Val Gin Phe Thr Leu Gly Trp Pro Leu 
• 180 185 190 

\ 

TAC TTA GCC TTC AAC GTC TCG GGG AGA CCT TAG GAC GGC GGC TTC GCT 624 
Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Gly Phe Ala 
195 200 205 

TGC CAT TTC CAC CCC AAC GCT CCC ATC TAC AAC GAC CGT GAG CGT CTC 672 
Cys His Phe His Pro Asn Ala Pro lie Tyr Asn Asp Arg Glu Arg Leu 
210 215 220 

CAG ATA TAC ATC TCC GAC GCT GGC ATC CTC GCC GTC TGC TAC GGT CTC 720 
Gin lie Tyr He Ser Asp Ala Gly He Leu Ala Val Cys Tyr Gly Leu 
225 230 235 240 

TAC CGC TAC GCT GCT GTC CAA GGA GTT GCC TCG ATG GTC TGC TTC TAC 768 
Tyr Arg Tyr Ala Ala Val Gin Gly Val Ala Ser Met Val Cys Phe Tyr 
245 250 255 

GGA GTT CCT CTT CTG ATT GTC AAC GGG TTC TTA GTT TTG ATC ACT TAC 816 
Gly Val Pro Leu Leu He Val Asn Gly Phe Leu Val Leu He Thr Tyr 
260 265 270 

TTG CAG CAC ACG CAT CCT TCC CTG CCT CAC TAT GAC TCG TCT GAG TGG 864 
Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp 
275 280 285 

GAT TGG TTG AGG GGA GCT TTG GCC ACC GTT GAC AGA GAC TAC GGA ATC 912 
Asp Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He 
290 295 300 

TTG AAC AAG GTC TTC CAC AAT ATC ACG GAC ACG CAC GTG GCG CAT CAC 960 
Leu Asn Lys Val Phe His Asn He Thr Asp Thr His Val Ala His His 
305 310 315 320 

CTG TTC TCG ACC ATG CCG CAT TAT CAT GCG ATG GAA GCT ACG AAG GCG 1008 
Leu Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Lys Ala 
325 330 335 

ATA AAG CCG ATA CTG GGA GAG TAT TAT CAG TTC GAT GGG ACG CCG GTG 1056 
He Lys Pro He Leu Gly Glu Tyr Tyr Gin Phe Asp Gly Thr Pro Val 
340 345 • 350 

GTT AAG GCG ATG TGG AGG GAG GCG AAG GAG TGT ATC TAT GTG GAA CCG 1104 
Val Lys Ala Met Trp Arg Glu Ala Lys Glu Cys He Tyr Val Glu Pro 
355 360 365 

GAC AGG CAA GGT GAG AAG AAA GGT GTG TTC TGG TAC AAC AAT AAG TTA T 1153 
Asp Arg Gin Gly Glu Lys Lys Gly Val Phe Trp Tyr Asn Asn Lys Leu 
370 375 380 

GA H55 
(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

\ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 

15 10 15 

Glu Thr Asp Asn He Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 

20 25 30 

Val Gly Glu Leu Lys Lys Ala He Pro Pro His Cys Phe Lys Arg Ser 

35 40 45 

He Pro Arg Ser Phe Ser Tyr Leu He Trp Asp He He He Ala Ser 

50 55 60 

Cys Phe Tyr Tyr Val Ala Thr Thr Tyr Phe Pro Leu Leu Pro His Pro 
65 70 75 80 

Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 

85 90 95 

Leu Thr Gly Val Trp Val lie Ala His Lys Cys Gly His His Ala Phe 

100 105 HO 

Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu He Phe His Ser 

115 120 125 

Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 

130 135 140 

His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys 
145 150 155 160 

Lys Lys Ser Asp He Lys Trp Tyr Gly Lys Tyr Leu Asn Asn Pro Leu 

165 170 175 

Gly Arg Thr Val Met Leu Thr Val Gin Phe Thr Leu Gly Trp Pro Leu 

180 185 190 

Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Gly Phe Ala 

195 200 205 

Cys His Phe His Pro Asn Ala Pro He Tyr Asn Asp Arg Glu Arg Leu 

210 215 220 

Gin He Tyr He Ser Asp Ala Gly lie Leu Ala Val Cys Tyr Gly Leu 
225 230 235 240 

Tyr Arg Tyr Ala Ala Val Gin Gly Val Ala Ser Met Val Cys Phe Tyr 

245 250 255 

Gly Val Pro Leu Leu He Val Asn Gly Phe Leu Val Leu He ..Thr Tyr 

260 265 270 

Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp 

275 280 285 

Asp Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He 

290 295 30a 

Leu Asn Lys Val Phe His Asn He Thr Asp Thr His Val Ala His His 
305 310 315 320 

Leu Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Lys Ala 

325 330 335 

He Lys Pro He Leu Gly Glu Tyr Tyr Gin Phe Asp Gly Thr Pro Val 

340 345 350. 

Val Lys Ala Met Trp Arg Glu Ala Lys Glu Cys He Tyr Val Glu Pro 

355 360 365 

Asp Arg Gin Gly Glu Lys Lys Gly Val Phe Trp Tyr Asn Asn Lys Leu 
370 375 380 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : Genomic DNA 
(ix) FEATURE: 

V (A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 . . . 1152 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

ATG GGT GCA GGT GGA AGA ATG CAA GTG TCT CCT CCC TCC AAG AAG TCT 4 8 

Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 
1 5 10 15 

GAA ACC GAC ACC ATC AAG CGC GTA CCC TGC GAG ACA CCG CCC TTC ACT 96 
Glu Thr Asp Thr lie Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 
20 25 30 

GTC GGA GAA CTC AAG AAA GCA ATC CCA CCG CAC TGT TTC AAA CGC TCG .144 
Val Gly Glu Leu Lys Lys Ala lie Pro Pro His Cys Phe Lys Arg Ser 
35 40 45 

ATC CCT CGC TCT TTC TCC TAC CTC ATC TGG GAC ATC ATC ATA GCC TCC 192 
lie Pro Arg Ser Phe Ser Tyr Leu lie Trp Asp lie lie lie Ala Ser 
50 55 60 

TGC TTC TAC TAC GTC GCC ACC ACT TAC TTC CCT CTC CTC CCT CAC CCT 240 
Cys Phe Tyr Tyr Val Ala Thr Thr Tyr Phe Pro Leu Leu Pro His Pro 
65 70 75 80 

CTC TCC TAC TTC GCC TGG CCT CTC TAC TGG GCC TGC CAA GGG TGC GTC 288 
Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 
85 90 95 

CTA ACC GGC GTC TGG GTC ATA GCC CAC GAG TGC GGC CAC CAC GCC TTC 336 
Leu Thr Gly Val Trp Val lie Ala His Glu Cys Gly His His Ala Phe 
100 105 110 

AGC GAC TAC CAG TGG CTT GAC GAC ACC GTC GGT CTC ATC TTC CAC TCC 384 
Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu He Phe His Ser 
115 120 125 

TTC CTC CTC GTC CCT TAC TTC TCC TGG AAG TAC AGX CAT CGA CGC CAC 432 
Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 
13 0 135 140 

CAT TCC AAC ACT GGC TCC CTC GAG AGA GAC GAA GTG TTT GTC CCC AAG 480 
His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys 
145 150 . 155 160 

AAG AAG TCA GAC ATC AAG TGG TAC GGC AAG TAC CTC AAC AAC CCT TTG 52 8 

Lys Lys Ser Asp lie Lys Trp Tyr Gly Lys Tyr Leu Asn Asn Pro Leu 
165 170 175 

GGA CGC ACC GTG ATG TTA ACG GTT CAG TTC ACT CTC GGC TGG CCG TTG 576 
Gly Arg Thr Val Met Leu Thr Val Gin Phe Thr Leu Gly Trp Pro Leu 
180 185 190 

TAC TTA GCC TTC AAC GTC TCG GGA AGA CCT TAC GAC GGC GGC TTC GCT, 624 
Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Gly Phe Ala 
195 200 205 
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TGC CAT TTC CAC CCC AAC GCT CCC ATC TAC AAC GAC CGC GAG CGT CTC 672 
Cys His Phe His Pro Asn Ala Pro lie Tyr Asn Asp Arg Glu Arg Leu 
210 • 215 220 

\ 

CAG ATA TAC ATC TCC GAC GCT GGC ATC CTC GCC GTC TGC TAC GGT CTC 720 
Gin lie Tyr lie Ser Asp Ala Gly lie Leu Ala Val Cys Tyr Gly Leu 
225 230 235 240 

TTC CGT TAC GCC GCC GCG CAG GGA GTG GCC TCG ATG GTC TGC TTC TAC 768 
Phe Arg Tyr Ala Ala Ala Gin Gly Val Ala Ser Met Val Cys Phe Tyr 
245 250 255 

GGA GTC CCG CTT CTG ATT GTC AAT GGT TTC CTC GTG TTG ATC ACT TAC 816 
Gly Val Pro Leu Leu lie Val Asn Gly Phe Leu Val Leu lie Thr Tyr 
260 265 270 

TTG CAG CAC ACG CAT CCT TCC CTG CCT CAC TAC GAT TCG TCC GAG TGG 864 
Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser. Glu Trp 
275 280 285 

GAT TGG TTG AGG GGA GCT TTG GCT ACC GTT GAC AGA GAC TAC GGA ATC 912 
Asp Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly lie 
290 295 300 

TTG AAC AAG GTC TTC CAC AAT ATT ACC GAC ACG CAC GTG GCG CAT CAT 960 
Leu Asn Lys Val Phe His Asn lie Thr Asp Thr His Val Ala His His 
305 310 315 320 



z _ 



CTG TTC TCC ACG ATG CCG CAT TAT CAC GCG ATG GAA GCT ACC AAG GCG 1008 
Leu Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Lys Ala 
325 330 335 

ATA AAG CCG ATA CTG GGA GAG TAT TAT CAG TTC GAT GGG ACG CCG GTG 1056 
He Lys Pro He Leu Gly Glu Tyr Tyr Gin Phe Asp Gly Thr Pro Val 
340 345 350 

GTT AAG GCG ATG TGG AGG GAG GCG AAG GAG TGT ATC TAT GTG GAA CCG .1104 
Val Lys Ala Met Trp Arg Glu Ala Lys Glu Cys lie Tyr Val Glu Pro 
355 360 365 

GAC AGG CAA GGT GAG AAG AAA GGT GTG TTC TGG TAC AAC AAT AAG TTA T 1153 
Asp Arg Gin Gly Glu Lys Lys Gly Val Phe Trp Tyr Asn Asn Lys Leu 
370 375 38Q 

GA 1155 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 384 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 

1.5 10 15 

Glu Thr Asp Thr He Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 
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t z 



20 25 30 

Val Gly Glu Leu Lys Lys Ala He Pro Pro His Cys Phe Lys Arg Ser 

35- 40 45 

lie- Pro Arg Ser Phe Ser Tyr Leu He Trp Asp He He He Ala Ser 

50 55 60 

Cys Phe Tyr Tyr Val Ala. Thr Thr Tyr Phe Pro Leu Leu Pro His Pro 
65 70 75 80 

Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 

85 90 95 

Leu Thr Gly Val Trp Val lie Ala His Glu Cys Gly His His Ala Phe 

100 105 - HO 

Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu lie Phe His Ser 

115 120 125 

Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 

130 135 140 

His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys 
145 150 155 160 

Lys Lys Ser Asp He Lys Trp Tyr Gly Lys Tyr Leu Asn Asn Pro Leu 

165 170 175 

Gly Arg Thr Val Met Leu Thr Val Gin Phe Thr Leu Gly Trp Pro Leu 

180 185 190 

Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Gly Phe Ala 

195 200 205 

Cys His Phe His Pro Asn Ala Pro He Tyr Asn Asp Arg Glu Arg Leu 

210 215 220 

Gin He Tyr He Ser Asp Ala Gly He Leu Ala Val Cys Tyr Gly Leu 
225 230 235 240 

Phe Arg Tyr Ala Ala Ala Gin Gly Val Ala Ser Met Val Cys Phe Tyr 

245 250 255 

Gly Val Pro Leu Leu lie Val Asn Gly Phe Leu Val Leu He Thr Tyr 

260 265 270 

Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp 

275 280 285 

Asp Trp Leu Arg Gly Ala Leu Ala Thr Val, Asp Arg Asp Tyr Gly lie 

290 295 300 

Leu Asn Lys Val Phe His Asn He Thr Asp Thr His Val Ala His His 
305 310 315 320 

Leu Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Lys Ala 

325 330 335 

He Lys Pro lie Leu Gly Glu Tyr Tyr Gin Phe Asp Gly Thr Pro Val 

340 345 350 

Val Lys Ala Met Trp Arg Glu Ala Lys Glu Cys He Tyr Val Glu Pro 

355 360 .365 

Asp Arg Gin Gly Glu Lys Lys Gly . Val Phe Trp Tyr Asn Asn Lys Leu 
370 375 380 

(2) INFORMATION FOR SEQ ID NO:15: 

(i) . SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .1152 
(D) OTHER INFORMATION: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATG GGT GCA GGT GGA AGA ATG CAA GTG TCT CCT CCC TCC AAG AAG TCT 48 
Mefe Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 
1 5 10 15 

GAA ACC GAC ACC ATC AAG CGC GTA CCC TGC GAG ACA CCG CCC TTC ACT 96 
Glu Thr Asp Thr lie Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 
20 25 30 

GTC GGA GAA CTC AAG AAA GCA ATC CCA CCG CAC TGT TTC AAA CGC TCG 144 
Val Gly Glu Leu Lys Lys Ala lie Pro Pro His Cys Phe Lys Arg Ser 
35 40 45 

ATC CCT CGC TCT TTC TCC TAC CTC ATC TGG GAC ATC ATC ATA GCC TCC 192 
lie Pro Arg Ser Phe Ser Tyr Leu lie Trp Asp lie lie lie Ala Ser 
50 55 60 

TGC TTC TAC TAC GTC GCC ACC ACT TAC TTC CCT CTC CTC CCT CAC CCT 240 
Cys Phe Tyr Tyr Val Ala Thr Thr Tyr Phe Pro Leu Leu . Pro His Pro 
65 70 75 80 

CTC TCC TAC TTC GCC TGG CCT CTC TAC TGG GCC TGC CAA GGG TGC GTC 288 
Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys Gin Gly Cys Val 
85 90 95 

CTA ACC GGC GTC TGG GTC ATA GCC CAC GAG TGC GGC CAC CAC GCC TTC 336 
Leu Thr Gly Val Trp Val lie Ala His Glu Cys Gly His His Ala Phe 
100 105 110 

AGC GAC TAC CAG TGG CTT GAC GAC ACC GTC GGT CTC ATC TTC CAC TCC 384 
Ser Asp Tyr Gin Trp Leu Asp Asp Thr Val Gly Leu lie Phe His Ser 
115 120 125 

TTC CTC CTC GTC CCT TAC TTC TCC TGG AAG TAC AGT CAT CGA CGC CAC 432 
Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 
130 135 140 

CAT TCC AAC ACT GGC TCC CTC GAG AGA GAC GAA GTG TTT GTC CCC AAG 480 
His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys 
145 150 155 160 

AAG AAG TCA GAC ATC AAG TGG TAC GGC AAG TAC CAG AAC AAC CCT TTG 528 
Lys Lys Ser Asp lie Lys Trp Tyr Gly Lys Tyr His Asn Asn Pro Leu 
165 170 175 

GGA CGC ACC GTG ATG TTA ACG GTT CAG TTC ACT CTC GGC TGG CCG TTG 576 
Gly Arg Thr Val Met Leu Thr Val Gin Phe Thr Leu Gly Trp Pro Leu 
180 185 190. 

TAC TTA GCC TTC AAC GTC TCG GGA AGA CCT TAC GAC GGC GGC TTC GCT 624 
Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Gly Phe Ala 
195 200 205 

TGC CAT TTC CAC CCC AAC GCT CCC ATC TAC AAC GAC CGC GAG CGT CTC 672 
Cys His Phe His Pro Asn Ala Pro lie Tyr Asn Asp Arg Glu Arg Leu 
210 215 220 

CAG ATA TAC ATC TCC GAC GCT GGC ATC CTC GCC GTC TGC TAC GGT CTC 720 
Gin He Tyr He Ser Asp Ala Gly lie Leu Ala Val Cys Tyr Gly Leu 
225 230 235 240 
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TTC CGT TAC GCC GCC GCG CAG GGA GTG GCC TCG ATG GTC TGC TTC TAC 768 
Phe Arg Tyr Ala Ala Ala Gin Gly Val Ala Ser Met Val Cys Phe Tyr 
245 250 255 

V 

GGA GTC CCG CTT CTG ATT GTC AAT GGT TTC CTC GTG TTG ATC ACT TAC 816 
Gly Val Pro Leu Leu lie Val Asn Gly Phe Leu Val Leu He Thr Tyr 
260 265 270 

TTG CAG CAC ACG CAT CCT TCC CTG CCT CAC TAC GAT TCG TCC GAG TGG 864 
Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp 
275 280 285 

GAT TGG TTG AGG GGA GCT TTG GCT ACC GTT GAC AGA GAC TAC GGA ATC 912 
Asp Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Gly He 
290 295 300 

TTG AAC AAG GTC TTC CAC AAT ATT ACC GAC ACG CAC GTG GCG CAT CAT 960 
Leu Asn Lys Val Phe His Asn He Thr Asp Thr His Val Ala His His 
305 310 315 320 

CTG TTC TCC ACG ATG CCG CAT TAT CAC GCG ATG GAA GCT ACC AAG GCG 1008 
Leu Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Lys Ala 
325 330 335 

ATA AAG CCG ATA CTG GGA GAG TAT TAT CAG TTC GAT GGG ACG CCG GTG 1056 
He Lys Pro He Leu Gly Glu Tyr Tyr Gin Phe Asp Gly Thr Pro Val 
340 345 350 

GTT AAG GCG ATG TGG AGG GAG GCG AAG GAG TGT ATC TAT GTG GAA CCG 1104 
Val Lys Ala Met Trp Arg Glu Ala Lys Glu Cys He Tyr Val Glu Pro 
355 360 365 

GAC AGG CAA GGT GAG AAG AAA GGT GTG TTC TGG TAC AAC AAT AAG TTA T 1153 
Asp Arg Gin Gly Glu Lys Lys Gly Val Phe Trp Tyr Asn Asn Lys Leu 
370 375 380 

GA 1155 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 amino acids 

(B) TYPE: amino acid • 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

. Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 
1 5 10 15 

Glu Thr Asp Thr He Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 

20 25 30 

Val Gly Glu Leu Lys Lys Ala He Pro Pro His Cys Phe Lys Arg Ser 

35 40 45 

He Pro Arg Ser Phe Ser Tyr Leu He Trp Asp He He He Ala Ser 

50 55 60 

Cys Phe Tyr Tyr Val Ala Thr Thr Tyr Phe Pro Leu Leu Pro His Pro 
65 70 75 80 



75 





Leu 


Ser 


Tyr 


Phe 


Ala 


Trp 


Pro Leu 


Tyr 


Trp Ala Cys Gin 


Gly Cys 


Val 












85 








90 


95 






Leu 


Thr 


Gly 


Val 


Trp 


Val 


He Ala 


His 


Glu Cys Gly His 


His Ala 


Phe 










100 








105 




110 






Ser 


Asp 


Tyr 


Gin 


Trp 


Leu 


Asp Asp 


Thr 


Val Gly Leu He 


Phe His 


Ser 








115 








120 




125 








Phe 


Leu 


Leu 


Val 


Pro 


Tyr 


Phe Ser 


Trp 


Lys Tyr Ser His 


Arg Arg 


His 






130 










135 




140 








His 


Ser 


Asn 


Thr 


Gly 


Ser 


Leu Glu 


Arg 


Asp Glu Val Phe 


Val Pro 


Lys 




145 










150 






155 




160 




Lys 


Lys 


Ser 


Asp 


lie 


Lys 


Trp Tyr 


Gly 


Lys Tyr His Asn 


Asn Pro 


Leu 












165 








170 


175 






Gly 


Arg 


Thr 


Val 


Met 


Leu 


Thr Val 


Gin 


Phe Thr Leu Gly 


Trp Pro 


Leu 










180 








185 




190 






Tyr 


Leu 


Ala 


Phe 


Asn 


Val 


Ser Gly 


Arg 


Pro Tyr Asp Gly 


Gly Phe 


Ala 








195 








200 




205 








Cys 


His 


Phe 


His 


Pro 


Asn 


Ala Pro 


He 


Tyr Asn Asp Arg 


Glu Arg 


Leu 






210 










215 




220 








Gin 


He 


Tyr 


He 


Ser 


Asp 


Ala Gly 


He 


Leu Ala Val Cys 


Tyr Gly 


Leu 




225 










230 






235 




240 




Phe 


Arg 


Tyr 


Ala 


Ala 


Ala 


Gin Gly 


Val 


Ala Ser Met Val 


Cys Phe 


Tyr 












245 








250 


255 




S3. 


Gly 


Val 


Pro 


Leu 


Leu 


He 


Val Asn 


Gly 


Phe Leu Val Leu 


He Thr 


Tyr 








260 








265 




270 






Leu 


Gin 


His 


Thr 


His 


Pro 


Ser Leu 


Pro 


His Tyr Asp Ser 


Ser Glu 


Trp 








275 








280 




285 








Asp 


Trp. 


Leu 


Arg 


Gly 


Ala 


Leu Ala 


Thr 


Val Asp Arg Asp 


Tyr Gly 


He 






290 










295 




300 








Leu 


Asn 


Lys 


Val 


Phe 


His 


Asn He 


Thr 


Asp Thr His Val 


Ala His 


His 


PI 


305 








310 






315 




320 




Leu 


Phe 


Ser 


Thr 


Met 


Pro 


His Tyr 


His 


Ala Met Glu Ala 


Thr Lys 


Ala 












325 






330 


335 






He 


Lys 


Pro 


He 


Leu 


Gly 


Glu Tyr 


Tyr 


Gin Phe Asp Gly 


Thr Pro 


Val 










340 








345 




350 




: ; 


Val 


Lys 


Ala 


Met 


Trp 


Arg 


Glu Ala 


Lys 


Glu Cys He Tyr 


Val Glu 


Pro 








355 








360 




365 






iy 


Asp 


Arg 


Gin 


Gly 


Glu 


Lys 


Lys Gly 


Val 


Phe Trp Tyr Asn 


Asn Lys 


Leu 






370 










375 




380 







(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1152 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ATG GGT GCA GGT GGA AGA ATG CAA GTG TCT CCT CCC TCC AAG AAG TCT 48 
Met Gly Ala Gly Gly Arg Met Gin Val Ser Pro Pro Ser Lys Lys Ser 
15 10 15 
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i 



4 



GAA ACC GAC ACC ATC AAG CGC GTA CCC TGC GAG ACA CCG CCC TTC ACT 96 

Glu Thr Asp Thr lie Lys Arg Val Pro Cys Glu Thr Pro Pro Phe Thr 

. 20 25 30 

\ 

GTC GGA GAA CTC AAG AAA GCA ATC CCA CCG CAC TGT TTC AAA CGC TCG 144 

Val Gly Glu Leu Lys Lys Ala lie Pro Pro His Cys Phe Lys Arg Ser 

35 . 40 45 

ATC CCT CGC TCT TTC TCC TAC CTC ATC TGG GAC ATC ATC ATA GCC TCC 192 

lie Pro Arg Ser Phe Ser Tyr Leu lie Trp Asp lie lie lie Ala Ser 
50 55 60 

TGC TTC TAC TAC GTC GCC ACC ACT TAC TTC CCT CTC CTC CCT CAC CCT 240 

Cys Phe Tyr Tyr Val Ala Thr Thr Tyr Phe Pro Leu Leu Pro His Pro 

65 70 75 80 

CTC TCC TAC TTC GCC TGG CCT CTC TAC TGG GCC TGC CAA GGG TGC GTC 288 

Leu Ser Tyr Phe Ala Trp Pro Leu Tyr Trp Ala Cys. Gin Gly Cys Val 

85 90 95 



CTA ACC GGC GTC TGG GTC ATA GCC CAC GAG TGC GGC CAC CAC GCC TTC 
Leu Thr Gly Val Trp Val lie Ala His Glu Cys Gly His His Ala Phe 
. 100 105 110 



336 



AGC GAC TAC CAG TGG CTT GAC GAC ACC GTC GGT CTC ATC TTC CAC TCC 
Ser Asp Tyr Gin Trp Leu Asp Asp Thr. Val Gly Leu lie Phe His Ser 
115 120 125 



384 



TTC CTC CTC GTC CCT TAC TTC. TCC TGG AAG TAC AGT CAT CGA CGC CAC 
Phe Leu Leu Val Pro Tyr Phe Ser Trp Lys Tyr Ser His Arg Arg His 
130 135 140 



432 



CAT TCC AAC ACT GGC TCC CTC GAG AGA GAC GAA GTG TTT GTC CCC AAG 
His Ser Asn Thr Gly Ser Leu Glu Arg Asp Glu Val Phe Val Pro Lys 
145 150 155 160 



480 



AAG AAG TCA GAC ATC AAG TGG TAC GGC AAG TAC CTC AAC AAC CCT TTG 
Lys Lys Ser Asp lie Lys Trp Tyr Gly Lys Tyr Leu Asn Asn Pro Leu 
165 . 170 175 



528 



GGA CGC ACC GTG ATG TTA ACG GTT CAG TTC ACT CTC GGC TGG CCG TTG 
Gly Arg Thr Val Met Leu Thr Val Gin Phe Thr Leu Gly Trp Pro Leu 
180 185 . 190 



576 



TAC TTA GCC TTC AAC GTC TCG GGA AGA CCT TAC GAC GGC GGC TTC GCT 
Tyr Leu Ala Phe Asn Val Ser Gly Arg Pro Tyr Asp Gly Gly Phe Ala 
195 200 205 



624 



TGC CAT TTC CAC CCC AAC GCT CCC ATC TAC AAC GAC CGC GAG CGT CTC 
Cys His Phe His Pro Asn Ala Pro lie Tyr Asn Asp Arg Glu Arg Leu 
210 215 220 



672 



CAG ATA TAC ATC TCC GAC GCT GGC ATC CTC GCC GTC TGC TAC GGT CTC 
Gin lie Tyr lie Ser Asp Ala Gly lie Leu Ala Val Cys Tyr Gly Leu 
225 230 235 240 



720 



TTC CGT TAC GCC GCC GCG CAG GGA GTG GCC TCG ATG GTC TGC TTC TAC 
Phe Arg Tyr Ala Ala Ala Gin Gly Val Ala Ser Met Val Cys Phe Tyr 
245 250 255 



768 



77 



• 6 

GGA GTC CCG CTT CTG ATT GTC AAT GGT TTC CTC GTG TTG ATC ACT TAC 816 

Gly Val Pro Leu Leu lie Val Asn Gly Phe Leu Val Leu lie Thr Tyr 

• 260 265 270 

\ 

TTG CAG CAC ACG CAT CCT TCC CTG CCT CAC TAC GAT TCG TCC GAG TGG 864 

Leu Gin His Thr His Pro Ser Leu Pro His Tyr Asp Ser Ser Glu Trp 

275 280 285 ^ 

GAT TGG TTG AGG GGA GCT TTG GCT ACC GTT GAC AGA GAC TAC GAA ATC 912 j 

Asp Trp Leu Arg Gly Ala Leu Ala Thr Val Asp Arg Asp Tyr Glu lie 
290 295 300 

TTG AAC AAG GTC TTC CAC AAT ATT ACC GAC ACG CAC GTG GCG CAT CAT 960 
Leu Asn Lys Val Phe His Asn lie Thr Asp Thr His Val Ala His. His 
305 310 315 320 

CTG TTC TCC ACG ATG CCG CAT TAT CAC GCG ATG GAA GCT ACC AAG GCG 1008 
Leu Phe Ser Thr Met Pro His Tyr His Ala Met Glu Ala Thr Lys Ala 
325 330 335 

ATA AAG CCG ATA CTG GGA GAG TAT TAT CAG TTC GAT GGG ACG CCG GTG 1056 
lie Lys Pro lie Leu Gly Glu Tyr Tyr Gin Phe Asp Gly Thr Pro Val 
340 345 350 

GTT AAG GCG ATG TGG AGG GAG GCG AAG GAG TGT ATC TAT GTG GAA CCG 1104 
Val Lys Ala Met Trp Arg Glu Ala Lys Glu Cys lie Tyr Val Glu Pro 
355 360 365 

GAC AGG CAA GGT GAG AAG AAA GGT GTG TTC TGG TAC AAC AAT AAG TTA T 1153 
Asp Arg Gin Gly Glu Lys Lys Gly Val Phe Trp Tyr Asn Asn Lys Leu 
370 375 380 

GA 1155 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 





(xi) SEQUENCE 


DESCRIPTION; 


: SEQ ID NO: 18: 


Met 


Gly 


Ala 


Gly 


Gly 


Arg 


Met 


Gin 


Val 


Ser Pro Pro Ser Lys Lys Ser 


1 








5 










10 15 


Glu 


Thr 


Asp 


Thr 


He 


Lys 


Arg 


Val 


Pro 


Cys Glu Thr Pro Pro Phe Thr 








20 










25 


30 


Val 


Gly 


Glu 


Leu 


Lys 


Lys 


Ala 


He 


Pro 


Pro His Cys Phe Lys Arg Ser 






35 










40 




45 


He 


Pro 


Arg 


Ser 


Phe 


Ser 


Tyr 


Leu 


He 


Trp Asp He He He Ala Ser 




50 










55 






60 


Cys 


Phe 


Tyr 


Tyr 


Val 


Ala 


Thr 


Thr 


Tyr 


Phe Pro Leu Leu Pro His Pro 


65 










70 








75 80 


Leu 


Ser 


Tyr 


Phe 


Ala 


Trp 


Pro 


Leu 


Tyr 


Trp Ala Cys Gin Gly Cys Val 










85 










90 95 


Leu 


Thr 


Gly 


Val 


Trp 


Val 


He 


Ala 


His 


Glu Cys Gly His His Ala Phe 








100 










105 


110 



78 



Ser Asp 


Tyr 


Gin 


Trp 


Leu 


Asp Asp 






115 








120 


Phe 

V 


Leu 
130 


Leu 


Val 


Pro 


Tyr 


Phe Ser 
135 


His 


Ser 


Asn 


Thr 


Gly 


Ser 


Leu Glu 


145 








150 




Lys 


Lys 


Ser 


Asp 


He 
165 


Lys 


Trp Tyr 


Gly 


Arg 


Thr 


Val 
180 


Met 


Leu 


Thr Val 


Tyr 


Leu 


Ala 
195 


Phe 


Asn 


Val 


Ser Gly 
200 


Cys 


His 
210 


Phe 


His 


Pro 


Asn 


Ala Pro 
215 


Gin 


lie 


Tyr 


He 


Ser 


Asp 


Ala Gly 


225 










230 




Phe 


Arg 


Tyr 


Ala 


Ala 


Ala 


Gin Gly 










245 




Gly 


Val 


Pro 


Leu 
260 


Leu 


He 


Val Asn 


Leu 


Gin 


His 
275 


Thr 


His 


Pro 


Ser Leu 
280 


Asp 


Trp 
290 


Leu 


Arg 


Gly 


Ala 


Leu Ala 
295 


Leu 


Asn 


Lys 


Val 


Phe 


His 


Asn He 


305 








310 




Leu 


Phe 


Ser 


Thr 


Met 


Pro 


His Tyr 










325 




He 


Lys 


Pro 


He 
340 


Leu 


Gly 


Glu. Tyr 


Val 


Lys 


Ala 
355 


Met 


Trp 


Arg 


Glu Ala 
360 


Asp 


Arg 
370 


Gin 


Gly 


Glu 


Lys 


Lys Gly 
375 



Thr Val Gly Leu 
Trp Lys 
Arg Asp 



Gly Lys 
. 170 
Gin Phe 
185 

Arg Pro 

He Tyr 

lie Leu 

Val Ala 
250 
Gly Phe 
265 

Pro His 

Thr Val 

Thr Asp 

His Ala 
330 
Tyr Gin 
345 

Lys Glu 
Val Phe 



Tyr Ser 
140 
Glu Val 
155 

Tyr Leu 



Thr Leu 

Tyr Asp 

Asn Asp 
220 
Ala Val 
235 

Ser Met 

Leu Val 

Tyr Asp 

Asp Arg 
300 
Thr His 
315 

Met Glu 

Phe Asp 

Cys He 

Trp Tyr 
380 



He Phe 
125 

His Arg 

Phe Val 

Asn Asn 

Gly Trp 
190 
Gly Gly 
205 

Arg Glu 

Cys Tyr 

Val Cys 

Leu He 
270 
Ser Ser 
285 

Asp Tyr 

Val Ala 

Ala Thr 

Gly Thr 
350 
Tyr Val 
365 

Asn Asn 



His Ser 

Arg His 

Pro Lys 
160 
Pro Leu 
175 

Pro Leu 

Phe Ala 

Arg Leu 

Gly Leu 
240 
Phe Tyr 
255 

Thr Tyr 

Glu Trp 

Glu He 

His His 
320 
Lys Ala 
335 

Pro Val 
Glu Pro 
Lys Leu 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

* 

(ii) MOLECULE TYPE: Other Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GGATATGATG ATGGTGAAAG A 21 
(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

TCTTTCACCA TCATCATATC C 21 
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(2) INFORMATION FOR SEQ ID NO: 21: 

" (i) SEQUENCE CHARACTERISTICS: 

\ (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
GTTATGAAGC AAAGAAGAAA C 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 

GTTTCTTCTT TGCTTTGCTT CATAAC 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

CAUCAUCAUC AUCTTCTTCG TAGGGTTCAT CG 

( 2 ) INFORMATION FOR SEQ ID NO : 24 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

CUACUACUAC UATCATAGAA GAGAAAGGTT CAG 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 
V (A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
CAUCAUCAUC AUCATGGGTG CACGTGGAAG AA 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 26 
CUACUACUAC UATCTTTCAC CATCATCATA TCC 
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